Skip to content

Conversation

@alexcheng1982
Copy link
Contributor

In the current implementation of QuestionAnswerAdvisor, the user text is always used as the query sent to VectorStore's similaritySearch method. This may not be the case for some advanced RAG scenarios, where the original user query may be expanded, transformed, or rewritten to get the actual query sent to the vector store.

The new customizeSearchRequest method provides an extension point to customize the SearchRequest used by VectorStore's similaritySearch method. This allows subclasses of QuestionAnswerAdvisor to apply custom logic to update the SearchRequest.

…isor`

The new `customizeSearchRequest` method provides an extension point to further customize the `SearchRequest` used by `VectorStore` to search for similar documents.
* @return the customized {@link SearchRequest}
*/
protected SearchRequest customizeSearchRequest(SearchRequest searchRequest, AdvisedRequest request,
Map<String, Object> context) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is it necessary to inherit and reimplement the implementation to return a SearchRequest method, even though QuestionAndAnswerAdvisor is already receiving a SearchRequest object through the constructor?

@markpollack
Copy link
Member

@alexcheng1982 This is indeed a very important use case to address. Previous iterations of what became the Advisor approach better accommodated this, it in fact kept a history of the changes to the user message. Instead of a subclass approach, what if we make use of the context in the advisor and have a well known key which is 'transformedUserTextor some name like that. This way there can be a chain of advisors such that before theQuestionAnswerAdvisoradvisor executes, aQueryRewriteAdvisorexecutes and puts the new updated query under the keytransformedUserTextin the context. Then theQuestionAnswerAdvisor` would look in that area of the context first and use the text there if present and if not, fall back to the current impl.

Thoughts?

@youngmoneee
Copy link
Contributor

I’m implementing a proxy object to intercept requests from the QuestionAndAnswerAdvisor to customize a Similarity Search.

.prompt()
.advisors(spec -> spec
    .param(CHAT_MEMORY_CONVERSATION_ID_KEY, chatId)
    .param(SEARCH_REQUEST, searchRequest)
    .param(QUERY, queryString)
    // pass query or SearchRequest
)
.user(userMessage)

Wouldn't it be more natural to pass the SearchRequest object through AdvisorSpec rather than using inheritance?

request.advisorParams().getOrDefault(
        SEARCH_REQUEST,
        this.searchRequest
    )

@ThomasVitale
Copy link
Contributor

Using the new (experimental) RAG features, advanced strategies for query transformation/expansion are supported. More info in the documentation: https://docs.spring.io/spring-ai/reference/api/retrieval-augmented-generation.html#_retrievalaugmentationadvisor_incubating and https://docs.spring.io/spring-ai/reference/api/retrieval-augmented-generation.html#_pre_retrieval

@ilayaperumalg
Copy link
Member

Thanks @ThomasVitale for the input.
@alexcheng1982 Please re-open this if you still see the issue. Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants